In this notebook we analyze the first “think/believe” task, in which participants completed a series of fill-in-the-blanks by choosing between two options: “think” and “believe.”
Overview
From the preregistration (link):
“Our overarching hypothesis for the present study is that […] other languages will have an epistemic verb that is more likely to be used for religious attitude reports (similar to English “believe”) and a different epistemic verb that is more likely to be used for matter-of-fact attitude reports (similar to English “think”).
For this study, we are examining five languages in five regions of interest: (i) Mandarin in China; (ii) Thai in Thailand; (iii) Bislama (an English-based creole language) on the Melanesian Island of Vanuatu; (iv) Fante in Ghana; and (v) American English in the Bay Area, California.
We thus have five more specific sub-hypotheses. For each of the first four languages / regions of interest, we hypothesize that a set of words or phrases exists whose usage parallels the difference between usage of “think” and “believe” in American English, with one word or phrase (the “think” analogue) being used for more matter-of-fact attitude reports and the other (the “believe” analogue) being more likely to be used for religious attitude reports. That gives us our first four sub-hypotheses: that Mandarin, Thai, Bislama and Fante speakers will each use two different words in a manner parallel to the use of “think” and “believe” in an American English setting as identified by Heiphetz, Landers, and Van Leeuwen. Our fifth sub-hypothesis is that the Bay Area portion of the study will replicate the results of the earlier study of Heiphetz, Landers, and Van Leeuwen."
KW EXECUTIVE SUMMARY (2020-01-19): We replicated the original finding in the US: participants were more likely to circle “believe” for religious than fact questions. We found the same pattern in all five countries/langauges included in this study.
The pattern was somewhat weaker in Ghana/Fante than in other countries/language (though it was still significant), and the pattern was somewhat stronger in Thailand/Thai than in other countries/languages.
Analysis: KW without looking at preregistration
Here’s how I analyzed the data before looking at the preregistration. I think these analyses are valuable because they’re a little more efficient than the preregistered analyses – no need for follow-up tests – and they directly test the question of whether the effect of interest varies across countries/langauges.
Technical note: Unless specified otherwise, all of these analyses use “effect coding” for categorical variables (e.g., country, category of question) – meaning that each country/langauge is compared to the “grand mean” collapsing across all countries/languages. Because of degrees of freedom issues, each analysis only compares 4 of the 5 countries to the grand mean – by default, I’ve left out the comparison of the US/English to the grand mean, but stats for that comparison could easily be calculated (if we left out another country/language instead). This is just to say that you won’t see statements like “The effect was exaggerated in the US relative to other countries,” although they might be true.
KW Analysis #1
First, I used a mixed effects logistic regression predicting how likely a participant was to circle “believe” based on the superordinate category of the question (“religious” questions or “fact” questions), the country they were in/language they were using (US/English, Ghana/Fante, Thailand/Thai, China/Mandarin, or Vanuatu/Bislama), and an interaction between them, with a maximal random effects structure (random interpcepts and slopes by subject, and random intercepts by question). This analysis gives me a sense of (1) Whether participants were more likely to circle “believe” for religious questions than fact questions, and whether this tendency varied by country/language, controlling for the fact that the overall rates of circling “believe” might vary by country/language (and accounting for individual differences and differences across individual questions).
r1.1 <- lmer(believe ~ super_cat * country
+ (1 + super_cat | thb1_subj) + (1 | question),
data = d1_long)
| Parameter |
β |
β' |
β'' |
Std. Err. |
df |
t |
p |
|
| Intercept |
0.60 |
- |
- |
0.03 |
24.00 |
23.59 |
<0.001 |
*** |
| Category (religious) |
0.20 |
0.39 |
0.39 |
0.03 |
26.58 |
7.61 |
<0.001 |
*** |
| Country (Gh.) |
0.09 |
0.10 |
0.10 |
0.01 |
338.99 |
6.71 |
<0.001 |
*** |
| Country (Th.) |
-0.03 |
-0.04 |
-0.04 |
0.01 |
338.99 |
-2.44 |
0.015 |
* |
| Country (Ch.) |
-0.03 |
-0.03 |
-0.03 |
0.01 |
338.99 |
-2.14 |
0.033 |
* |
| Country (Vt.) |
-0.01 |
-0.02 |
-0.02 |
0.01 |
338.99 |
-1.23 |
0.221 |
|
| Category (religious) × Country (Gh.) |
-0.10 |
-0.13 |
-0.13 |
0.02 |
339.01 |
-5.67 |
<0.001 |
*** |
| Category (religious) × Country (Th.) |
0.07 |
0.09 |
0.09 |
0.02 |
339.01 |
4.27 |
<0.001 |
*** |
| Category (religious) × Country (Ch.) |
-0.01 |
-0.01 |
-0.01 |
0.02 |
339.01 |
-0.55 |
0.585 |
|
| Category (religious) × Country (Vt.) |
0.00 |
0.01 |
0.01 |
0.01 |
339.01 |
0.33 |
0.744 |
|
The effects of primary interest are in bold:
- Category (religious): Collapsing across countries/languages, participants were indeed more likely to say “believe” for “religious” questions. This effect is much larger than any of the differences across countries/languages (as you can see by comparing the β values for different effects).
- Country (Gh.): Participants in Ghana were generally more likely than other participants to say “believe,” collapsing across question categories.
- Country (Th.): Participants in Thailand were generally less likely than other participants to say “believe,” collapsing across question categories.
- Country (Ch.): Participants in China were no more or less likely than other participants to say “believe,” collapsing across question categories.
- Country (Vt.): Participants in Vanuatu were no more or less likely than other participants to say “believe,” collapsing across question categories.
- Category (religious) x Country (Gh.): The difference in rates of “believe” responses between question categories was smaller in Ghana than in other countries.
- Category (religious) x Country (Th.): The difference in rates of “believe” responses between question categories was larger in Thailand than in other countries.
- Category (religious) x Country (Ch.): The difference in rates of “believe” responses between question categories was no smaller or larger in China than in other countries.
- Category (religious) x Country (Vt.): The difference in rates of “believe” responses between question categories was no smaller or larger in Vanuatu than in other countries.
Take-away: The predicted effect is evident in this dataset. It appears to be exaggerated in Thailand and diminished in Ghana.
KW Analyses #1a-1e (by country)
Next, I did this same analysis within each country/langauge alone (using the most maximal random effect structure that converged across all countries/languages).
# note: using most maximal common random effects structure
r1.1_us <- lmer(believe ~ super_cat +
# (1 + super_cat | thb1_subj) + (1 | question),
# (1 + super_cat || thb1_subj) + (1 | question), # failed to converge
(1 | thb1_subj) + (1 | question),
# (1 + super_cat | thb1_subj),
data = d1_long %>% filter(country == "US"))
r1.1_gh <- lmer(believe ~ super_cat +
# (1 + super_cat | thb1_subj) + (1 | question),
# (1 + super_cat || thb1_subj) + (1 | question), # failed to converge
(1 | thb1_subj) + (1 | question),
# (1 + super_cat | thb1_subj),
data = d1_long %>% filter(country == "Ghana"))
r1.1_th <- lmer(believe ~ super_cat +
# (1 + super_cat | thb1_subj) + (1 | question),
# (1 + super_cat || thb1_subj) + (1 | question), # failed to converge
(1 | thb1_subj) + (1 | question),
# (1 + super_cat | thb1_subj),
data = d1_long %>% filter(country == "Thailand"))
r1.1_ch <- lmer(believe ~ super_cat +
# (1 + super_cat | thb1_subj) + (1 | question),
# (1 + super_cat || thb1_subj) + (1 | question), # failed to converge
(1 | thb1_subj) + (1 | question),
# (1 + super_cat | thb1_subj),
data = d1_long %>% filter(country == "China"))
r1.1_vt <- lmer(believe ~ super_cat +
# (1 + super_cat | thb1_subj) + (1 | question), # failed to converge
# (1 + super_cat || thb1_subj) + (1 | question),
(1 | thb1_subj) + (1 | question),
# (1 + super_cat | thb1_subj),
data = d1_long %>% filter(country == "Vanuatu"))
| Country |
Parameter |
β |
Std. Err. |
df |
t |
p |
|
| US |
Intercept |
0.58 |
0.02 |
26.45 |
26.45 |
<0.001 |
*** |
| Category (religious) |
0.24 |
0.02 |
23.00 |
11.51 |
<0.001 |
*** |
| Ghana |
Intercept |
0.69 |
0.04 |
30.42 |
18.86 |
<0.001 |
*** |
| Category (religious) |
0.09 |
0.03 |
23.00 |
2.80 |
0.010 |
* |
| Thailand |
Intercept |
0.57 |
0.03 |
29.64 |
20.37 |
<0.001 |
*** |
| Category (religious) |
0.27 |
0.03 |
23.00 |
10.08 |
<0.001 |
*** |
| China |
Intercept |
0.57 |
0.03 |
31.80 |
22.90 |
<0.001 |
*** |
| Category (religious) |
0.19 |
0.02 |
23.00 |
8.55 |
<0.001 |
*** |
| Vanuatu |
Intercept |
0.59 |
0.04 |
23.38 |
14.26 |
<0.001 |
*** |
| Category (religious) |
0.20 |
0.04 |
23.00 |
4.95 |
<0.001 |
*** |
The effects of primary interest are in bold, and the take-away is clear: In every country/language, participants were more likely to say “believe” in “religious” questions than in “fact” questions.
KW Analysis #2
In this analysis, I treated country/language as a random rather than fixed effect (with participants nested within countries). (Note that I had to use a simpler random effects structure in order to get the model to converge.)
r1.2 <- lmer(believe ~ super_cat
# + (1 + super_cat | country/thb1_subj) + (1 | question), # failed to converge
# + (1 + super_cat || country/thb1_subj) + (1 | question), # failed to converge
# + (1 + super_cat | country/thb1_subj), # failed to converge
+ (1 | country/thb1_subj) + (1 | question),
data = d1_long)
| Parameter |
β |
Std. Err. |
df |
t |
p |
|
| Intercept |
0.60 |
0.04 |
11.69 |
16.43 |
<0.001 |
*** |
| Category (religious) |
0.21 |
0.03 |
23.00 |
8.26 |
<0.001 |
*** |
The effect still holds.
KW Analysis #3
Finally, I ran a version of this first model looking at 5 categories of questions (rather than 2 superordinate categories): Christian religious, Buddhist religious, well-known fact, esoteric fact, and personal fact. I compared these categories using planned orthogonal contrasts.
r1.3 <- lmer(believe ~ category * country
# + (1 + category | thb1_subj) + (1 | question), # failed to converge
+ (1 + category || thb1_subj) + (1 | question),
# + (1 + category | thb1_subj),
# + (1 + category || thb1_subj),
# + (1 | thb1_subj) + (1 | question),
data = d1_long)
| Parameter |
β |
Std. Err. |
df |
t |
p |
|
| Intercept |
| Intercept |
0.56 |
0.02 |
23.31 |
34.07 |
<0.001 |
*** |
| Category comparisons |
| Category (Religious vs. fact) |
0.08 |
0.01 |
28.13 |
11.27 |
<0.001 |
*** |
| Category (Christian vs. Buddhist religious) |
0.06 |
0.03 |
20.55 |
2.44 |
0.024 |
* |
| Category (well-known & esoteric vs. personal fact) |
0.06 |
0.01 |
21.75 |
3.90 |
<0.001 |
*** |
| Category (well-known vs. esoteric fact) |
0.07 |
0.03 |
22.30 |
2.67 |
0.014 |
* |
| Country comparisons |
| Country (Gh.) |
0.11 |
0.01 |
348.32 |
7.83 |
<0.001 |
*** |
| Country (Th.) |
-0.04 |
0.01 |
348.32 |
-3.44 |
<0.001 |
*** |
| Country (Ch.) |
-0.03 |
0.01 |
348.32 |
-1.87 |
0.062 |
|
| Country (Vt.) |
-0.01 |
0.01 |
348.32 |
-1.24 |
0.215 |
|
| Interactions: Ghana |
| Category (Religious vs. fact) × Country (Gh.) |
-0.04 |
0.01 |
339.31 |
-5.66 |
<0.001 |
*** |
| Category (Christian vs. Buddhist religious) × Country (Gh.) |
0.05 |
0.02 |
992.01 |
2.85 |
0.004 |
** |
| Category (well-known & esoteric vs. personal fact) × Country (Gh.) |
-0.03 |
0.01 |
341.34 |
-2.66 |
0.008 |
** |
| Category (well-known vs. esoteric fact) × Country (Gh.) |
0.02 |
0.02 |
340.19 |
1.24 |
0.214 |
|
| Interactions: Thailand |
| Category (Religious vs. fact) × Country (Th.) |
0.03 |
0.01 |
339.31 |
4.26 |
<0.001 |
*** |
| Category (Christian vs. Buddhist religious) × Country (Th.) |
-0.05 |
0.01 |
992.01 |
-3.86 |
<0.001 |
*** |
| Category (well-known & esoteric vs. personal fact) × Country (Th.) |
0.02 |
0.01 |
341.34 |
2.26 |
0.024 |
* |
| Category (well-known vs. esoteric fact) × Country (Th.) |
-0.02 |
0.02 |
340.19 |
-1.15 |
0.252 |
|
| Interactions: China |
| Category (Religious vs. fact) × Country (Ch.) |
0.00 |
0.01 |
339.31 |
-0.55 |
0.585 |
|
| Category (Christian vs. Buddhist religious) × Country (Ch.) |
-0.04 |
0.02 |
992.01 |
-2.48 |
0.013 |
* |
| Category (well-known & esoteric vs. personal fact) × Country (Ch.) |
-0.04 |
0.01 |
341.34 |
-3.29 |
0.001 |
** |
| Category (well-known vs. esoteric fact) × Country (Ch.) |
-0.05 |
0.02 |
340.19 |
-2.49 |
0.013 |
* |
| Interactions: Vanuatu |
| Category (Religious vs. fact) × Country (Vt.) |
0.00 |
0.01 |
339.31 |
0.33 |
0.745 |
|
| Category (Christian vs. Buddhist religious) × Country (Vt.) |
0.06 |
0.01 |
992.01 |
4.66 |
<0.001 |
*** |
| Category (well-known & esoteric vs. personal fact) × Country (Vt.) |
0.04 |
0.01 |
341.34 |
4.50 |
<0.001 |
*** |
| Category (well-known vs. esoteric fact) × Country (Vt.) |
0.06 |
0.02 |
340.19 |
3.74 |
<0.001 |
*** |
The first orthogonal contrast compared the two “religious” categories to the three “fact” categories (“Category (Religoius vs. fact)”). This parallels the previous analyses, and the results are similar: Overall, participants were more likely to circle “believe” for religious questions than fact questions, and this tendency was diminished in Ghana and exaggerated in Thailand.
The second orthogonal contrast compared Christian to Buddhist “religious” questions. Overall, participants were more likely to circle “believe” for Christian questions, and this tendency was exaggerated in Ghana and Vanuatu (which were predominantly Christian samples) and diminished in Thailand and China (which were more Buddhist samples).
The third orthogonal contrast compared well-known and esoteric facts, on the one hand, to personal facts, on the other. Overall, participants were more likely to circle “believe” for well-known and esoteric facts, and this tendency was diminished in Ghana and China, and exaggerated in Thailand and Vanuatu.
The fourth orthogonal contrast compared well-known to esoteric facts. Overall, particpants were more likely to circle “believe” for well-known factrs, and then tendency was diminished in China, and exaggerated in Vanuatu.
Note that these findings statistically control for differences across samples in the overall rate of circling “believe” (which was generally higher in Ghana and lower in Thailand).
Analysis: Based on preregistration
From preregistration:
“Survey 1: We will conduct a 5 (Site: China vs. Thailand vs. Vanuatu vs. Ghana vs. United States) x 2 (Statement Type: religion vs. fact) mixed ANOVA with repeated measures on the second factor and the proportion of trials on which participants completed sentences using a form the word “believe” (or its respective translation) as the dependent measure. To look for finer-grained differences between different religious and factual statements, we will also conduct a 5 (Site: China vs. Thailand vs. Vanuatu vs. Ghana vs. United States) x 5 (Statement Type: Buddhist religious statements vs. Christian religious statements vs. life facts vs. well-known facts vs. esoteric facts) mixed ANOVA with repeated measures on the second factor and the proportion of trials on which participants completed sentences using a form of the word “believe” (or its respective translation) as the dependent measure. In all cases where omnibus ANOVAs are significant, we will conduct pairwise analyses comparing each statement type with each other statement type and each site with each other site."
d1_anova <- d1_long %>%
distinct(thb1_subj, country, super_cat, question, believe) %>%
group_by(thb1_subj, country, super_cat) %>%
summarise(prop_believe = mean(believe)) %>%
ungroup() %>%
mutate(thb1_subj = factor(thb1_subj))
contrasts(d1_anova$country) <- contrast_country
contrasts(d1_anova$super_cat) <- contrast_super_cat
Prereg Analysis #1
Here is the first preregistered analyis: a 5 (country) x 2 (question category) mixed ANOVA with repeated measures on the second factor and the proportion of trials on which participants circled “berlieve” as the DV.
r1.4 <- d1_anova %>%
anova_test(dv = prop_believe,
wid = thb1_subj,
between = country,
within = super_cat)
get_anova_table(r1.4)
ANOVA Table (type III tests)
Effect DFn DFd F p p<.05 ges
1 country 4 339 11.390 1.10e-08 * 0.043
2 super_cat 1 339 580.443 1.94e-75 * 0.533
3 country:super_cat 4 339 11.224 1.46e-08 * 0.081
This analysis aligns with the regressions above, suggesting that participants’ tendency to circle “believe” varied by country/language (country) and by question category (super_cat), and the difference between question category varied across countries/languages (i.e., there was an interaction: country:super_cat).
The preregistration indicated that we’d conduct pairwise follow-up analyses comparing the two question categories and comparing pairs of countires/languages – but I don’t really think we’re interested in comparing pairs of countries/languages, so I’m going to skip that for now. Instead, I’ll compare the two questions categories within each country/language (to explore the significant interaction).
Here we go:
Comparing question categories
r1.5a <- t.test(prop_believe ~ super_cat, paired = T, d1_anova); r1.5a
Paired t-test
data: prop_believe by super_cat
t = 24.783, df = 343, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
0.3836875 0.4498396
sample estimates:
mean of the differences
0.4167636
Collapsing across countries/languages, participants circled significantly more “believe” responses for questions in the religious category (42%) than they did for questions in the fact category (NA%).
Comparing question categories within countries/languages
# US
r1.5b_us <- t.test(prop_believe ~ super_cat, paired = T,
d1_anova %>% filter(country == "US")); r1.5b_us
Paired t-test
data: prop_believe by super_cat
t = 11.179, df = 75, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
0.3986478 0.5715276
sample estimates:
mean of the differences
0.4850877
# Ghana
r1.5b_gh <- t.test(prop_believe ~ super_cat, paired = T,
d1_anova %>% filter(country == "Ghana")); r1.5b_gh
Paired t-test
data: prop_believe by super_cat
t = 6.6368, df = 47, p-value = 2.909e-08
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
0.1316330 0.2461448
sample estimates:
mean of the differences
0.1888889
# Thailand
r1.5b_th <- t.test(prop_believe ~ super_cat, paired = T,
d1_anova %>% filter(country == "Thailand")); r1.5b_th
Paired t-test
data: prop_believe by super_cat
t = 16.586, df = 74, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
0.4669138 0.5944196
sample estimates:
mean of the differences
0.5306667
# China
r1.5b_ch <- t.test(prop_believe ~ super_cat, paired = T,
d1_anova %>% filter(country == "China")); r1.5b_ch
Paired t-test
data: prop_believe by super_cat
t = 7.3784, df = 47, p-value = 2.186e-09
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
0.2747757 0.4807798
sample estimates:
mean of the differences
0.3777778
# Vanuatu
r1.5b_vt <- t.test(prop_believe ~ super_cat, paired = T,
d1_anova %>% filter(country == "Vanuatu")); r1.5b_vt
Paired t-test
data: prop_believe by super_cat
t = 17.054, df = 96, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
0.3598187 0.4546143
sample estimates:
mean of the differences
0.4072165
The difference between question categories was significant in each country/language considered alone.
---
title: "Think Believe 1 (forced choice)"
output: 
  html_notebook:
    toc: true
    toc_float: true
---

```{r setup}
knitr::opts_chunk$set(echo = F, message = F)
```

In this notebook we analyze the first "think/believe" task, in which participants completed a series of fill-in-the-blanks by choosing between two options: "think" and "believe."


```{r}
source("./scripts/dependencies.R")
source("./scripts/custom_funs.R")
source("./scripts/var_recode_contrast.R")
```

```{r}
d1_raw <- read_xlsx("../data/ThinkBelieve1_organized.xlsx", sheet = "V1&V2 no dupes") %>%
  # eliminate one duplicate
  group_by(thb1_subj) %>%
  top_n(1, thb1_batc) %>% 
  ungroup() %>%
  mutate(thb1_ctry = factor(thb1_ctry, levels = levels_country))
```

```{r}
key1 <- read_xlsx("../data/ThinkBelieve1_organized.xlsx", sheet = 1)[1,] %>% 
  t() %>% 
  data.frame() %>% 
  rownames_to_column("question") %>%
  rename(question_text = ".") %>%
  # get rid of white space
  mutate(question_text = gsub("\\s+", " ", question_text)) %>%
  # hand code question categories
  mutate(category = case_when(
    grepl("final paper", question_text) |
      grepl("highway into town", question_text) |
      grepl("grocery store", question_text) |
      grepl("chemistry book", question_text) |
      grepl("cooking noodles", question_text) ~ "life fact",
    grepl("praying to God", question_text) |
      grepl("angels deliver", question_text) |
      grepl("go to Heaven", question_text) |
      grepl("changed water", question_text) |
      grepl("human sins", question_text) ~ "Christian religious",
    grepl("cycle of death", question_text) |
      grepl("Buddha found spiritual", question_text) |
      grepl("lotus flower bloomed", question_text) |
      grepl("ghosts suffer", question_text) |
      grepl("burning incense", question_text) ~ "Buddhist religious",
    grepl("moon goes around", question_text) |
      grepl("Barack Obama", question_text) |
      grepl("using batteries", question_text) |
      grepl("Brazil", question_text) |
      grepl("ancient Roman", question_text) ~ "well-known fact",
    grepl("octopus", question_text) |
      grepl("John Brown", question_text) |
      grepl("taller mountain", question_text) |
      grepl("species of fish", question_text) |
      grepl("contains more copper", question_text) ~ "esoteric fact",
    TRUE ~ NA_character_)) %>%
  mutate(category = factor(category, 
                           levels = c("Christian religious", 
                                      "Buddhist religious", 
                                      "well-known fact", 
                                      "esoteric fact", 
                                      "life fact")),
         super_cat = case_when(grepl("fact", category) ~ "fact",
                               grepl("religious", category) ~ "religious",
                               TRUE ~ NA_character_),
         super_cat = factor(super_cat, levels = c("religious", "fact"))) %>%
  rownames_to_column("order") %>%
  mutate(order = as.numeric(order),
         question_text = gsub("‚Äô", "'", question_text),
         question_text_short = gsub("^.*that ", "...", question_text),
         var_name = names(d1_raw[names(d1_raw) != "thb1_version"]))
```

```{r}
d1 <- d1_raw %>%
  filter(thb1_ctry %in% levels_country) %>%
  mutate(thb1_ctry = factor(thb1_ctry, levels = levels_country),
         thb1_demo_sex = factor(thb1_demo_sex,
                                levels = c("Male", "Female", "Other")), 
         thb1_demo_age = as.numeric(as.character(thb1_demo_age))) %>%
  mutate_at(vars(thb1_demo_regp, thb1_demo_olang),
            funs(factor(., levels = c("NO", "YES")))) %>%
  mutate_at(vars(thb1_demo_rely, thb1_demo_impr, thb1_demo_imsn), 
            funs(factor(., levels = 1:7))) %>%
  mutate(thb1_demo_wors = factor(thb1_demo_wors, 
                                 levels = c("Never", 
                                            "Once a year or less",
                                            "A few times a year",
                                            "Once or twice a month",
                                            "Every week or more often")),
         thb1_demo_bgod = factor(thb1_demo_bgod,
                                 levels = c("Not at all believe",
                                            "Believe slightly",
                                            "Believe moderately",
                                            "Believe strongly")),
         thb1_demo_bbuh = factor(thb1_demo_bbuh,
                                 levels = c("Not at all believe",
                                            "Believe slightly",
                                            "Believe moderately",
                                            "Believe strongly")),
         thb1_demo_bosp = factor(thb1_demo_bosp,
                                 levels = c("Not at all believe",
                                            "Believe slightly",
                                            "Believe moderately",
                                            "Believe strongly")),
         thb1_demo_atsn = factor(thb1_demo_atsn,
                                 levels = c("There is no such thing as supernatural forces or beings",
                                            "We cannot know if there are supernatural forces and beings",
                                            "There might be supernatural forces and beings",
                                            "Supernatural forces and beings exist but we cannot know what they are like",
                                            "There definitely are supernatural forces and beings"))) %>%
  mutate_at(vars(thb1_demo_rely, thb1_demo_impr, thb1_demo_wors, thb1_demo_bgod, 
                 thb1_demo_bbuh, thb1_demo_bosp, thb1_demo_atsn, thb1_demo_imsn), 
            funs(num = as.numeric(.) - 1))

contrasts(d1$thb1_ctry) = contrast_country
```

```{r}
d1_long <- d1 %>%
  gather(question, response, thb1_ghostshunger:thb1_obama) %>%
  mutate(think = ifelse(grepl("think", response) |
                          grepl("thought", response), T, F),
         believe = ifelse(grepl("belie", response), T, F),
         response_cat = case_when(believe == T ~ "believe",
                                  think == T ~ "think",
                                  TRUE ~ NA_character_),
         response_cat = factor(response_cat, levels = c("think", "believe"))) %>%
  left_join(key1 %>% select(-question) %>% rename(question = var_name))

contrasts(d1_long$thb1_ctry) = contrast_country
# contrasts(d1_long$category) = contrast_category
contrasts(d1_long$category) = contrast_category_orth
contrasts(d1_long$super_cat) = contrast_super_cat
```

```{r}
# implement exclusion criteria and rename country variable
d1 <- d1 %>% 
  filter(thb1_ordr == "Yes", thb1_attn == "Pass") %>%
  rename(country = thb1_ctry)

d1_long <- d1_long %>% 
  filter(thb1_ordr == "Yes", thb1_attn == "Pass") %>%
  rename(country = thb1_ctry)
```


# Overview

From the preregistration ([link](https://aspredicted.org/p6iy3.pdf)):

> "Our overarching hypothesis for the present study is that [...] other languages will have an epistemic verb that is more likely to be used for religious attitude reports (similar to English “believe”) and a different epistemic verb that is more likely to be used for matter-of-fact attitude reports (similar to English “think”). 
> 
> For this study, we are examining five languages in five regions of interest: (i) Mandarin in China; (ii) Thai in Thailand; (iii) Bislama (an English-based creole
language) on the Melanesian Island of Vanuatu; (iv) Fante in Ghana; and (v) American English in the Bay Area, California. 
> 
> We thus have five more specific sub-hypotheses. For each of the first four languages / regions of interest, we hypothesize that a set of words or phrases exists whose usage parallels the difference between usage of “think” and “believe” in American English, with one word or phrase (the “think” analogue) being used for more matter-of-fact attitude reports and the other (the “believe” analogue) being more likely to be used for religious attitude reports. That gives us our first four sub-hypotheses: that Mandarin, Thai, Bislama and Fante speakers will each use two different words in a manner parallel to the use of
“think” and “believe” in an American English setting as identified by Heiphetz, Landers, and Van Leeuwen. Our fifth sub-hypothesis is that the Bay Area portion of the study will replicate the results of the earlier study of Heiphetz, Landers, and Van Leeuwen."


<p style="color:darkred">**KW EXECUTIVE SUMMARY (2020-01-19): We replicated the original finding in the US: participants were more likely to circle "believe" for religious than fact questions. We found the same pattern in all five countries/langauges included in this study.**</p>

<p style="color:darkred">**The pattern was somewhat weaker in Ghana/Fante than in other countries/language (though it was still significant), and the pattern was somewhat stronger in Thailand/Thai than in other countries/languages.**</p>


# Samples

Before we begin, it's important to note that we had unequal sample sizes by country:

```{r}
d1_raw %>% count(thb1_ctry)
```

However, `r d1_raw %>% filter(thb1_ordr == "No") %>% count() %>% as.numeric()` participants completed this task after completing other surveys, and an additional `r d1_raw %>% filter(thb1_ordr == "Yes", thb1_attn == "Fail") %>% count() %>% as.numeric()` failed the attention check. In the following analyses I will exclude these participants, leaving us with the following samples:

```{r}
d1 %>% count(country)
```

```{r}
sample_size_d1 <- d1 %>% 
  count(country) %>% 
  data.frame() %>%
  mutate(country_n = paste0(country, " (n=", n, ")"),
         country_n = reorder(country_n, as.numeric(country)))
```


# Plots

We'll begin by plotting responses of "think" (red) vs. "believe" (turquoise) to get an overall sense of any patterns in the data.

## By superordinate category

```{r, fig.width = 3, fig.asp = 0.5}
d1_long %>%
  left_join(sample_size_d1) %>%
  ggplot(aes(x = super_cat, 
             # put NAs on top of bar
             fill = factor(response_cat,
                           levels = c(NA, "think", "believe"), 
                           exclude = NULL))) +
  facet_grid(. ~ country_n, scales = "free", space = "free") +
  geom_bar(position = "fill", alpha = 0.7, color = "black", size = 0.1) +
  # geom_hline(yintercept = 0.5, lty = 2) +
  theme(axis.text.x = element_text(angle = 45, hjust = 1, vjust = 1),
        legend.position = "top") +
  labs(x = "category", y = "proportion", fill = "response")
```

## By category

```{r, fig.width = 3, fig.asp = 0.5}
d1_long %>%
  left_join(sample_size_d1) %>%
  ggplot(aes(x = category, 
             # put NAs on top of bar
             fill = factor(response_cat,
                           levels = c(NA, "think", "believe"), 
                           exclude = NULL))) +
  facet_grid(. ~ country_n, scales = "free", space = "free") +
  geom_bar(position = "fill", alpha = 0.7, color = "black", size = 0.1) +
  # geom_hline(yintercept = 0.5, lty = 2) +
  theme(axis.text.x = element_text(angle = 45, hjust = 1, vjust = 1),
        legend.position = "top") +
  labs(x = "category", y = "proportion", fill = "response")
```

## By question

```{r, fig.width = 6, fig.asp = 0.7}
d1_long %>%
  left_join(sample_size_d1) %>%
  ggplot(aes(x = reorder(str_wrap(question_text_short, 40), order), 
             # put NAs on top of bar
             fill = factor(response_cat,
                           levels = c(NA, "think", "believe"), 
                           exclude = NULL))) +
  facet_grid(country_n ~ category, scales = "free", space = "free") +
  geom_bar(position = "fill", alpha = 0.7, color = "black", size = 0.1) +
  # geom_hline(yintercept = 0.5, lty = 2) +
  theme(axis.text.x = element_text(angle = 45, hjust = 1, vjust = 1),
        legend.position = "top",
        plot.margin = (unit(c(0.2, 0.2, 0.2, 1.8), "cm"))) +
  labs(x = "category", y = "proportion", fill = "response")
```


# Analysis: KW without looking at preregistration

Here's how I analyzed the data before looking at the preregistration. I think these analyses are valuable because they're a little more efficient than the preregistered analyses -- no need for follow-up tests -- and they directly test the question of whether the effect of interest varies across countries/langauges.

Technical note: Unless specified otherwise, all of these analyses use "effect coding" for categorical variables (e.g., country, category of question) -- meaning that each country/langauge is compared to the "grand mean" collapsing across all countries/languages. Because of degrees of freedom issues, each analysis only compares 4 of the 5 countries to the grand mean -- by default, I've left out the comparison of the US/English to the grand mean, but stats for that comparison could easily be calculated (if we left out another country/language instead). This is just to say that you won't see statements like "The effect was exaggerated in the US relative to other countries," although they might be true.

## KW Analysis #1

First, I used a mixed effects logistic regression predicting how likely a participant was to circle "believe" based on the superordinate category of the question ("religious" questions or "fact" questions), the country they were in/language they were using (US/English, Ghana/Fante, Thailand/Thai, China/Mandarin, or Vanuatu/Bislama), and an interaction between them, with a maximal random effects structure (random interpcepts and slopes by subject, and random intercepts by question). This analysis gives me a sense of (1) Whether participants were more likely to circle "believe" for religious questions than fact questions, and whether this tendency varied by country/language, controlling for the fact that the overall rates of circling "believe" might vary by country/language (and accounting for individual differences and differences across individual questions).

```{r, echo = T}
r1.1 <- lmer(believe ~ super_cat * country 
             + (1 + super_cat | thb1_subj) + (1 | question),
             data = d1_long)
```

```{r}
regtab_fun(r1.1, std_beta = T) %>% regtab_style_fun(row_emph = c(2, 7:10))
```

```{r, include = F}
regtab_ran_fun(r1.1, subj_var = "thb1_subj") %>% regtab_style_fun()
```

The effects of primary interest are in bold:

- **Category (religious)**: Collapsing across countries/languages, participants were indeed more likely to say "believe" for "religious" questions. This effect is much larger than any of the differences across countries/languages (as you can see by comparing the β values for different effects).
- Country (Gh.): Participants in Ghana were generally more likely than other participants to say "believe," collapsing across question categories.
- Country (Th.): Participants in Thailand were generally less likely than other participants to say "believe," collapsing across question categories.
- Country (Ch.): Participants in China were no more or less likely than other participants to say "believe," collapsing across question categories.
- Country (Vt.): Participants in Vanuatu were no more or less likely than other participants to say "believe," collapsing across question categories.
- **Category (religious) x Country (Gh.)**: The difference in rates of "believe" responses between question categories was smaller in Ghana than in other countries.
- **Category (religious) x Country (Th.)**: The difference in rates of "believe" responses between question categories was larger in Thailand than in other countries.
- **Category (religious) x Country (Ch.)**: The difference in rates of "believe" responses between question categories was no smaller or larger in China than in other countries.
- **Category (religious) x Country (Vt.)**: The difference in rates of "believe" responses between question categories was no smaller or larger in Vanuatu than in other countries.

**Take-away: The predicted effect is evident in this dataset. It appears to be exaggerated in Thailand and diminished in Ghana.**

## KW Analyses #1a-1e (by country)

Next, I did this same analysis within each country/langauge alone (using the most maximal random effect structure that converged across all countries/languages). 

```{r, echo = T}
# note: using most maximal common random effects structure
r1.1_us <- lmer(believe ~ super_cat + 
                  # (1 + super_cat | thb1_subj) + (1 | question), 
                  # (1 + super_cat || thb1_subj) + (1 | question), # failed to converge
                  (1 | thb1_subj) + (1 | question),
                # (1 + super_cat | thb1_subj),
                data = d1_long %>% filter(country == "US"))

r1.1_gh <- lmer(believe ~ super_cat + 
                  # (1 + super_cat | thb1_subj) + (1 | question),
                  # (1 + super_cat || thb1_subj) + (1 | question), # failed to converge
                  (1 | thb1_subj) + (1 | question),
                # (1 + super_cat | thb1_subj),
                data = d1_long %>% filter(country == "Ghana"))

r1.1_th <- lmer(believe ~ super_cat + 
                  # (1 + super_cat | thb1_subj) + (1 | question), 
                  # (1 + super_cat || thb1_subj) + (1 | question), # failed to converge
                  (1 | thb1_subj) + (1 | question),
                # (1 + super_cat | thb1_subj),
                data = d1_long %>% filter(country == "Thailand"))

r1.1_ch <- lmer(believe ~ super_cat + 
                  # (1 + super_cat | thb1_subj) + (1 | question), 
                  # (1 + super_cat || thb1_subj) + (1 | question), # failed to converge
                  (1 | thb1_subj) + (1 | question),
                # (1 + super_cat | thb1_subj),
                data = d1_long %>% filter(country == "China"))

r1.1_vt <- lmer(believe ~ super_cat + 
                  # (1 + super_cat | thb1_subj) + (1 | question), # failed to converge
                  # (1 + super_cat || thb1_subj) + (1 | question),
                  (1 | thb1_subj) + (1 | question),
                # (1 + super_cat | thb1_subj),
                data = d1_long %>% filter(country == "Vanuatu"))
```

```{r}
bind_rows(regtab_fun(r1.1_us) %>% mutate(Country = "US"),
          regtab_fun(r1.1_gh) %>% mutate(Country = "Ghana"),
          regtab_fun(r1.1_th) %>% mutate(Country = "Thailand"),
          regtab_fun(r1.1_ch) %>% mutate(Country = "China"),
          regtab_fun(r1.1_vt) %>% mutate(Country = "Vanuatu")) %>%
  select(Country, everything()) %>%
  regtab_style_fun(row_emph = seq(2, 10, 2)) %>%
  collapse_rows(1)
```

The effects of primary interest are in bold, and **the take-away is clear: In every country/language, participants were more likely to say "believe" in "religious" questions than in "fact" questions**.


## KW Analysis #2

In this analysis, I treated country/language as a random rather than fixed effect (with participants nested within countries). (Note that I had to use a simpler random effects structure in order to get the model to converge.)

```{r, echo = T}
r1.2 <- lmer(believe ~ super_cat 
             # + (1 + super_cat | country/thb1_subj) + (1 | question), # failed to converge
             # + (1 + super_cat || country/thb1_subj) + (1 | question), # failed to converge
             # + (1 + super_cat | country/thb1_subj), # failed to converge
             + (1 | country/thb1_subj) + (1 | question),
             data = d1_long)
```

```{r}
regtab_fun(r1.2) %>% regtab_style_fun(row_emph = 2)
```

```{r, include = F}
regtab_ran_fun(r1.2, subj_var = "thb1_subj") %>% regtab_style_fun()
```

The effect still holds.

## KW Analysis #3

Finally, I ran a version of this first model looking at 5 categories of questions (rather than 2 superordinate categories): Christian religious, Buddhist religious, well-known fact, esoteric fact, and personal fact. I compared these categories using planned orthogonal contrasts.

```{r, echo = T}
r1.3 <- lmer(believe ~ category * country 
             # + (1 + category | thb1_subj) + (1 | question), # failed to converge
             + (1 + category || thb1_subj) + (1 | question),
             # + (1 + category | thb1_subj), 
             # + (1 + category || thb1_subj), 
             # + (1 | thb1_subj) + (1 | question),
             data = d1_long)
```

```{r}
regtab_fun(r1.3, 
           predictor_var1 = "category_relig_fact", 
           predictor_name1 = "Category (Religious vs. fact)",
           predictor_var2 = "category_relig_C_B",
           predictor_name2 = "Category (Christian vs. Buddhist religious)",
           predictor_var3 = "category_fact_WE_L",
           predictor_name3 = "Category (well-known & esoteric vs. personal fact)",
           predictor_var4 = "category_fact_W_E",
           predictor_name4 = "Category (well-known vs. esoteric fact)") %>% 
  regtab_style_fun(row_emph = c(2:5, 10:25)) %>%
  group_rows("Intercept", start_row = 1, end_row = 1) %>%
  group_rows("Category comparisons", start_row = 2, end_row = 5) %>%
  group_rows("Country comparisons", start_row = 6, end_row = 9) %>%
  group_rows("Interactions: Ghana", start_row = 10, end_row = 13) %>%
  group_rows("Interactions: Thailand", start_row = 14, end_row = 17) %>%
  group_rows("Interactions: China", start_row = 18, end_row = 21) %>%
  group_rows("Interactions: Vanuatu", start_row = 22, end_row = 25)
```

```{r, include = F}
regtab_ran_fun(r1.3, subj_var = "thb1_subj") %>% regtab_style_fun()
```

The first orthogonal contrast compared the two "religious" categories to the three "fact" categories ("Category (Religoius vs. fact)"). This parallels the previous analyses, and the results are similar: Overall, participants were more likely to circle "believe" for religious questions than fact questions, and this tendency was diminished in Ghana and exaggerated in Thailand.

The second orthogonal contrast compared Christian to Buddhist "religious" questions. Overall, participants were more likely to circle "believe" for Christian questions, and this tendency was exaggerated in Ghana and Vanuatu (which were predominantly Christian samples) and diminished in Thailand and China (which were more Buddhist samples).

The third orthogonal contrast compared well-known and esoteric facts, on the one hand, to personal facts, on the other. Overall, participants were more likely to circle "believe" for well-known and esoteric facts, and this tendency was diminished in Ghana and China, and exaggerated in Thailand and Vanuatu.

The fourth orthogonal contrast compared well-known to esoteric facts. Overall, particpants were more likely to circle "believe" for well-known factrs, and then tendency was diminished in China, and exaggerated in Vanuatu.

Note that these findings statistically control for differences across samples in the overall rate of circling "believe" (which was generally higher in Ghana and lower in Thailand).


# Analysis: Based on preregistration

From preregistration:

> "Survey 1: We will conduct a 5 (Site: China vs. Thailand vs. Vanuatu vs. Ghana vs. United States) x 2 (Statement Type: religion vs. fact) mixed ANOVA with repeated measures on the second factor and the proportion of trials on which participants completed sentences using a form the word “believe” (or its respective translation) as the dependent measure. To look for finer-grained differences between different religious and factual statements, we will also conduct a 5 (Site: China vs. Thailand vs. Vanuatu vs. Ghana vs. United States) x 5 (Statement Type: Buddhist religious statements vs. Christian religious statements vs. life facts vs. well-known facts vs. esoteric facts) mixed ANOVA with repeated measures on the second factor and the proportion of trials on which participants completed sentences using a form of the word “believe” (or its respective translation) as the dependent measure. In all cases where omnibus ANOVAs are significant, we will conduct pairwise analyses comparing each statement type with each other statement type and each site with each other site."

```{r, echo = T}
d1_anova <- d1_long %>%
  distinct(thb1_subj, country, super_cat, question, believe) %>%
  group_by(thb1_subj, country, super_cat) %>%
  summarise(prop_believe = mean(believe)) %>%
  ungroup() %>%
  mutate(thb1_subj = factor(thb1_subj))

contrasts(d1_anova$country) <- contrast_country
contrasts(d1_anova$super_cat) <- contrast_super_cat
```

## Prereg Analysis #1

Here is the first preregistered analyis: a 5 (country) x 2 (question category) mixed ANOVA with repeated measures on the second factor and the proportion of trials on which participants circled "berlieve" as the DV.

```{r, echo = T}
r1.4 <- d1_anova %>%
  anova_test(dv = prop_believe, 
             wid = thb1_subj, 
             between = country, 
             within = super_cat)

get_anova_table(r1.4)
```

This analysis aligns with the regressions above, suggesting that participants' tendency to circle "believe" varied by country/language (`country`) and by question category (`super_cat`), and the difference between question category varied across countries/languages (i.e., there was an interaction: `country:super_cat`).

The preregistration indicated that we'd conduct pairwise follow-up analyses comparing the two question categories and comparing pairs of countires/languages -- but I don't really think we're interested in comparing pairs of countries/languages, so I'm going to skip that for now. Instead, I'll compare the two questions categories within each country/language (to explore the significant interaction).

Here we go:

### Comparing question categories

```{r, echo = T}
r1.5a <- t.test(prop_believe ~ super_cat, paired = T, d1_anova); r1.5a
```

Collapsing across countries/languages, **participants circled significantly more "believe" responses for questions in the religious category (`r 100 * (r1.5a$estimate[1] %>% round(2))`%) than they did for questions in the fact category (`r 100 * (r1.5a$estimate[2] %>% round(2))`%)**.

### Comparing question categories within countries/languages

```{r, echo = T}
# US
r1.5b_us <- t.test(prop_believe ~ super_cat, paired = T,
                   d1_anova %>% filter(country == "US")); r1.5b_us

# Ghana
r1.5b_gh <- t.test(prop_believe ~ super_cat, paired = T,
                   d1_anova %>% filter(country == "Ghana")); r1.5b_gh

# Thailand
r1.5b_th <- t.test(prop_believe ~ super_cat, paired = T,
                   d1_anova %>% filter(country == "Thailand")); r1.5b_th

# China
r1.5b_ch <- t.test(prop_believe ~ super_cat, paired = T,
                   d1_anova %>% filter(country == "China")); r1.5b_ch

# Vanuatu
r1.5b_vt <- t.test(prop_believe ~ super_cat, paired = T,
                   d1_anova %>% filter(country == "Vanuatu")); r1.5b_vt
```

**The difference between question categories was significant in each country/language considered alone.**


# Analysis: Religion and religiosity

## Demographics

First, let's just look at how people in different countries replied to the relevant questions. 

### `thb1_demo_regp`: "Are you a part of any religious group?"

```{r}
d1 %>% 
  left_join(sample_size_d1) %>%
  ggplot(aes(x = country_n, 
             # put NAs on top of bar
             fill = factor(thb1_demo_regp,
                           levels = c(NA, "NO", "YES"), 
                           exclude = NULL))) +
  geom_bar() +
  labs(x = "country", y = "proportion", 
       fill = "Are you a part of any religious group?") +
  theme(legend.position = "top")
```

### `thb1_demo_rely`: "From 1 to 7, how religious are you? (1 = not religious at all, 7 =
extremely religious)"

Seems to have been omitted in Thailand?

```{r}
d1 %>% 
  left_join(sample_size_d1) %>%
  ggplot(aes(x = as.numeric(thb1_demo_rely))) +
  facet_grid(~ country_n) +
  geom_histogram(binwidth = 1) +
  scale_x_continuous(breaks = 1:7, minor_breaks = NULL) +
  labs(x = "From 1 to 7, how religious are you?", 
       y = "count")
```

### `thb1_demo_impr`: "From 1 to 7, how important to you is your religious practice?  (1 = not important at all, 7 = of utmost importance)"

Seems to have been omitted in Thailand?

```{r}
d1 %>% 
  left_join(sample_size_d1) %>%
  ggplot(aes(x = as.numeric(thb1_demo_impr))) +
  facet_grid(~ country_n) +
  geom_histogram(binwidth = 1) +
  scale_x_continuous(breaks = 1:7, minor_breaks = NULL) +
  labs(x = "From 1 to 7, how important to you is your religious practice?", 
       y = "count")
```

### `thb1_demo_wors`: "How often do you attend places of worship?"

```{r}
d1 %>% 
  left_join(sample_size_d1) %>%
  ggplot(aes(x = thb1_demo_wors)) +
  facet_grid(~ country_n) +
  geom_bar() +
  labs(x = "How often do you attend places of worship?", 
       y = "count") +
  theme(axis.text.x = element_text(angle = 45, hjust = 1, vjust = 1))
```

### `thb1_demo_bgod`: "What best describes your level of belief in God?"

```{r}
d1 %>% 
  left_join(sample_size_d1) %>%
  ggplot(aes(x = thb1_demo_bgod)) +
  facet_grid(~ country_n) +
  geom_bar() +
  labs(x = "What best describes your level of belief in God?", 
       y = "count") +
  theme(axis.text.x = element_text(angle = 45, hjust = 1, vjust = 1))
```

### `thb1_demo_bbuh`: "What best describes your level of belief in Buddha?"

```{r}
d1 %>% 
  left_join(sample_size_d1) %>%
  ggplot(aes(x = thb1_demo_bbuh)) +
  facet_grid(~ country_n) +
  geom_bar() +
  labs(x = "What best describes your level of belief in Buddha?", 
       y = "count") +
  theme(axis.text.x = element_text(angle = 45, hjust = 1, vjust = 1))
```

### `thb1_demo_bosp`: "What best describes your level of belief in another spiritual being (other than God or Buddha)?"

```{r}
d1 %>% 
  left_join(sample_size_d1) %>%
  ggplot(aes(x = thb1_demo_bosp)) +
  facet_grid(~ country_n) +
  geom_bar() +
  labs(x = "What best describes your level of belief in another spiritual being (other than God or Buddha)?", 
       y = "count") +
  theme(axis.text.x = element_text(angle = 45, hjust = 1, vjust = 1))
```

### `thb1_demo_atsn`: "What best describes your attitude towards the supernatural?

```{r, fig.width = 3.5, fig.asp = 0.8}
d1 %>% 
  left_join(sample_size_d1) %>%
  ggplot(aes(x = thb1_demo_atsn)) +
  facet_grid(~ country_n) +
  geom_bar() +
  labs(x = "What best describes your attitude towards the supernatural?", 
       y = "count") +
  theme(axis.text.x = element_text(angle = 45, hjust = 1, vjust = 1))
```

### `thb1_demo_imsn`: "From 1 to 7, how important to you is your attitude toward the supernatural? (1 = not important at all, 7 = of utmost importance)"

```{r}
d1 %>% 
  left_join(sample_size_d1) %>%
  ggplot(aes(x = as.numeric(thb1_demo_imsn))) +
  facet_grid(~ country_n) +
  geom_histogram(binwidth = 1) +
  scale_x_continuous(breaks = 1:7, minor_breaks = NULL) +
  labs(x = "From 1 to 7, how important to you is your attitude toward the supernatural?", 
       y = "count")
```

## Analyses

Now, let's look at how responses to our think/believe questions might have varied depending on religiosity/etc. For now, I'll just focus on a couple of variables that seem to have been answered in reasonable ways.

### `thb1_demo_rely`: “From 1 to 7, how religious are you? (1 = not religious at all, 7 = extremely religious)”

```{r, echo = T}
r1.6 <- lmer(believe ~ super_cat * country * thb1_demo_rely_num
             + (1 + super_cat | thb1_subj) + (1 | question),
             data = d1_long %>% 
               filter(country != "Thailand") %>%
               mutate(thb1_demo_rely_num = scale(thb1_demo_rely_num)),
             contrasts = list(country = "contr.sum"))
```

```{r}
regtab_fun(r1.6, std_beta = T, 
           country_var1 = "country1", country_name1 = "Country (US)",
           country_var2 = "country2", country_name2 = "Country (Ghana)",
           country_var3 = "country3", country_name3 = "Country (China)",
           predictor_var1 = "thb1_demo_rely_num", 
           predictor_name1 = "How religious are you?") %>% 
  regtab_style_fun(row_emph = c(10, 14:16))
```

This analysis suggests that greater religiosity was associated with an increased distinction between religious and fact questions. (Note that this analysis omits participants from Thailand, who did not answer this question about religiosity.)

```{r}
d1_long %>% 
  filter(country != "Thailand") %>%
  group_by(country, thb1_subj, thb1_demo_rely_num, super_cat) %>%
  summarise(believe_prop = mean(believe, na.rm = T)) %>%
  ungroup() %>%
  ggplot(aes(x = thb1_demo_rely_num, y = believe_prop, color = super_cat)) +
  facet_grid(. ~ country) +
  geom_jitter(alpha = 0.2, width = 0.1, height = 0.02) +
  geom_smooth(method = "lm") +
  scale_x_continuous(breaks = 0:6, labels = levels(d1$thb1_demo_rely)) +
  theme(legend.position = "top") +
  labs(x = "How religious are you?", y = "Proportion 'believe' responses",
       color = "Category")
```

```{r}
d1_long %>% 
  filter(country != "Thailand") %>%
  group_by(country, thb1_subj, thb1_demo_rely_num, super_cat) %>%
  summarise(believe_prop = mean(believe, na.rm = T)) %>%
  ungroup() %>%
  spread(super_cat, believe_prop) %>%
  mutate(diff = religious - fact) %>%
  ggplot(aes(x = thb1_demo_rely_num, y = diff)) +
  facet_grid(. ~ country) +
  geom_jitter(alpha = 0.2, width = 0.1, height = 0.02) +
  geom_smooth(method = "lm") +
  scale_x_continuous(breaks = 0:6, labels = levels(d1$thb1_demo_rely)) +
  theme(legend.position = "top") +
  labs(x = "How religious are you?", 
       y = "Difference in proportion 'believe' responses\n(religious questions - fact questions)",
       color = "Category")
```

### `thb1_demo_impr`: "From 1 to 7, how important to you is your religious practice?  (1 = not important at all, 7 = of utmost importance)"

```{r, echo = T}
r1.7 <- lmer(believe ~ super_cat * country * thb1_demo_impr_num
             + (1 + super_cat | thb1_subj) + (1 | question),
             data = d1_long %>% 
               filter(country != "Thailand") %>%
               mutate(thb1_demo_impr_num = scale(thb1_demo_impr_num)),
             contrasts = list(country = "contr.sum"))
```

```{r}
regtab_fun(r1.7, std_beta = T, 
           country_var1 = "country1", country_name1 = "Country (US)",
           country_var2 = "country2", country_name2 = "Country (Ghana)",
           country_var3 = "country3", country_name3 = "Country (China)",
           predictor_var1 = "thb1_demo_impr_num", 
           predictor_name1 = "How important is your religious practice?") %>% 
  regtab_style_fun(row_emph = c(10, 14:16))
```

This analysis suggests that more importance placed on religious practice was associated with an increased distinction between religious and fact questions. (Note that this analysis omits participants from Thailand, who did not answer this question about religiosity.)

```{r}
d1_long %>% 
  filter(country != "Thailand") %>%
  group_by(country, thb1_subj, thb1_demo_impr_num, super_cat) %>%
  summarise(believe_prop = mean(believe, na.rm = T)) %>%
  ungroup() %>%
  ggplot(aes(x = thb1_demo_impr_num, y = believe_prop, color = super_cat)) +
  facet_grid(. ~ country) +
  geom_jitter(alpha = 0.2, width = 0.1, height = 0.02) +
  geom_smooth(method = "lm") +
  scale_x_continuous(breaks = 0:6, labels = levels(d1$thb1_demo_impr)) +
  theme(legend.position = "top") +
  labs(x = "How important is your religious practice?", y = "Proportion 'believe' responses",
       color = "Category")
```

```{r}
d1_long %>% 
  filter(country != "Thailand") %>%
  group_by(country, thb1_subj, thb1_demo_impr_num, super_cat) %>%
  summarise(believe_prop = mean(believe, na.rm = T)) %>%
  ungroup() %>%
  spread(super_cat, believe_prop) %>%
  mutate(diff = religious - fact) %>%
  ggplot(aes(x = thb1_demo_impr_num, y = diff)) +
  facet_grid(. ~ country) +
  geom_jitter(alpha = 0.2, width = 0.1, height = 0.02) +
  geom_smooth(method = "lm") +
  scale_x_continuous(breaks = 0:6, labels = levels(d1$thb1_demo_impr)) +
  theme(legend.position = "top") +
  labs(x = "How important is your religious practice?", 
       y = "Difference in proportion 'believe' responses\n(religious questions - fact questions)",
       color = "Category")
```

### `thb1_demowors`: "How often do you attend places of worship?"

```{r, echo = T}
r1.8 <- lmer(believe ~ super_cat * country * thb1_demo_wors_num
             + (1 + super_cat | thb1_subj) + (1 | question),
             data = d1_long %>% 
               mutate(thb1_demo_wors_num = scale(thb1_demo_wors_num)))
```

```{r}
regtab_fun(r1.8, std_beta = T, 
           predictor_var1 = "thb1_demo_wors_num", 
           predictor_name1 = "How often do you attend places of worship?") %>% 
  regtab_style_fun(row_emph = c(12, 17:20))
```

This analysis suggests that frequency of attendence was associated with an increased distinction between religious and fact questions. 

```{r}
d1_long %>% 
  group_by(country, thb1_subj, thb1_demo_wors_num, super_cat) %>%
  summarise(believe_prop = mean(believe, na.rm = T)) %>%
  ungroup() %>%
  ggplot(aes(x = thb1_demo_wors_num, y = believe_prop, color = super_cat)) +
  facet_grid(. ~ country) +
  geom_jitter(alpha = 0.2, width = 0.1, height = 0.02) +
  geom_smooth(method = "lm") +
  scale_x_continuous(breaks = 0:4, labels = levels(d1$thb1_demo_wors)) +
  theme(legend.position = "top",
        axis.text.x = element_text(angle = 45, hjust = 1, vjust = 1)) +
  labs(x = "How often do you attend places of worship?", 
       y = "Proportion 'believe' responses",
       color = "Category")
```

```{r}
d1_long %>% 
  group_by(country, thb1_subj, thb1_demo_wors_num, super_cat) %>%
  summarise(believe_prop = mean(believe, na.rm = T)) %>%
  ungroup() %>%
  spread(super_cat, believe_prop) %>%
  mutate(diff = religious - fact) %>%
  ggplot(aes(x = thb1_demo_wors_num, y = diff)) +
  facet_grid(. ~ country) +
  geom_jitter(alpha = 0.2, width = 0.1, height = 0.02) +
  geom_smooth(method = "lm") +
  scale_x_continuous(breaks = 0:4, labels = levels(d1$thb1_demo_wors)) +
  theme(legend.position = "top",
        axis.text.x = element_text(angle = 45, hjust = 1, vjust = 1)) +
  labs(x = "How often do you attend places of worship?", 
       y = "Difference in proportion 'believe' responses\n(religious questions - fact questions)",
       color = "Category")
```




